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Abstract: In the usual factorization theorems, which give predictions only for in- 
clusive cross sections, there is considerable freedom in the choice of the scheme to 
define the parton distribution functions. These theorems do not directly apply to 
Monte-Carlo event generators, and more general factorization theorems which give 
predictions for fully exclusive cross sections are needed. It has been shown that 
appropriate parton distribution functions are uniquely defined by the showering al- 
gorithm. In this paper, we present results of calculations of the Monte-Carlo parton 
distribution functions in terms of the commonly used MS parton distribution func- 
tions. At small x the differences are large, which demonstrates the importance of 
using the correct parton distribution functions in an event generator rather than MS 
parton distribution functions. We present some simple approximations that enable 
an understanding of the sizes of the results to be obtained. 
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1. Introduction 

The standard factorization theorems Q of QCD only give predictions for inclusive 
cross sections. In Monte-Carlo (MC) event generators we generate complete events 
and implement QCD predictions for the detailed structure of the final state, therefore 
it is important to get both the inclusive cross section and the exclusive cross sections 
right. 

For inclusive cross sections, there is freedom in choosing the factorization scheme 
that defines the parton distribution functions (pdfs). But one of us has shown that 
this is not the case in MC event generators; the specific showering algorithm used in 
a particular event generator entails a particular definition of the pdfs that should be 
used in the event generator. 1 In this paper we first expand these arguments, showing 
how they are related to the requirement of obtaining correct exclusive cross sections. 

We then present and analyze the results of calculations for pdfs that are ap- 
propriate to the algorithm of Bengtsson, Sjostrand and van Zijl [[|, as is used, for 
example, in the event generators PYTHIA and RAPGAP. In order to reach the 
next-to-leading order (NLO) accuracy in both the inclusive and the exclusive cross 
sections, it is important to use the correct pdfs and the correspondingly determined 
NLO hard scattering coefficients. As an example, we show that in the DIS F 2 cal- 
culation, using MS pdfs instead of the correct ones for the specific event generator 
introduces an error of roughly 40% at small x. This can, of course, substantially 
affect the phenomenology. 

x See the Note Added at the end of the paper for earlier work on the same idea. 
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2. Factorization Schemes and Parton Distribution Functions 
in Monte-Carlo event generators 

2.1 Factorization theorem and Monte-Carlo event generators 

The factorization theorem states that appropriate inclusive cross sections with a large 
transverse momentum Q are given |IJ (to the leading power in Q) by a hard scattering 
coefficient convoluted with pdfs. Each hard scattering coefficient is infrared safe, 
calculable in perturbation theory and independent of the external hadron. The pdfs 
contain all the infrared sensitivity of the original cross section, they are external- 
hadron specific and are independent of the particular hard scattering process. 

For inclusive cross sections, it is well known that there is some freedom in choos- 
ing the prescription by which the pdfs are defined. A set of rules that makes the 
choices is called a 'factorization scheme'. Such a scheme both defines the pdfs and 
implies a rule for unambiguously calculating the hard scattering coefficients. 

It might be concluded that this also applies to the hard scattering coefficients in 
an MC event generator. As shown by one of us [0, this is not in fact the case, and 
we will now review the reasoning. 2 

An MC event generator calculates the exclusive components of the cross section, 
by using a combination of perturbatively calculated quantities for the larger scales 
and suitable modeling for the nonperturbative physics. The perturbative part con- 
sists of hard-scattering coefficients and evolution kernels. For the idea of computing 
NLO (and even higher order) corrections to make practical sense, the perturbative 
expansions of the hard scattering coefficients and the evolution kernels must be free 
of logarithms of large ratios of kinematic variables. 




J 

Figure 1: Two-parton correlation function used in definition of parton distribution func- 
tion. 

Since the cross section being computed is exclusive rather than inclusive, the rea- 
soning leading to the factorization theorem for inclusive processes does not directly 
apply, and more general arguments are mandatory |2|, ||. Now a pdf fi(x) is essen- 
tially the number density of partons of flavor i and fractional longitudinal momentum 

2 This implies that we disagree with the reasoning about NLO corrections to event generators 
that assumes an opposite conclusion, as in Refs. 
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x with an integral over transverse momentum and virtuality. An examination of 
the derivation of the event generator algorithms shows that the form of this integral 
and its cutoff at large transverse momentum and virtuality are determined by the 
showering algorithm. For example, if the Bengtsson-Sjostrand algorithm of Ref. M 
is used for the kinematics of the initial-state parton, then the pdf is 

fi(x) = J d 4 kd(x - k ■ (k + q)/p- q)6(Q 2 - \k 2 \) L(k,p), (2.1) 

where L(k,p) is a two-particle correlation function of two partons in the target 
hadron, Fig. [I]. The delta function gives the definition of the longitudinal fractional 
momentum variable in the algorithm, and the theta function implements the upper 
cutoff on virtuality. Some details concerning gauge invariance have not been pre- 
cisely specified, but for our purposes this will not matter. In any case the definition 
is different from the MS definition, and the specification of the showering algorithm 
implies a specific prescription for defining the pdfs, without any further choice. 

In an event generator, the definitions of the parton kinematics in an initial-state 
shower are phenomenologically manifested in the kinematics of the hadronic final 
state. Hence if the parton kinematics are mismatched between the parton-shower 
algorithm and the definition of the pdf, the calculation of the final state is incorrect. 
This contrasts with the calculation of an inclusive cross section, where the relevant 
part of the final state is summed over; all that matters in an inclusive cross section 
is that given the kinematics for the struck parton a final state is generated with 
probability unity. 

As usual, at lowest order it is legitimate to approximate the pdfs in the correct 
scheme by those in some other conveniently chosen scheme (e.g., MS). This is because 
the error caused by the incorrect pdf is of the same order as the error caused by the 
unimplemented NLO correction in the hard scattering. But beyond LO, this is not 
an appropriate approximation. 

The methods of, for example, Potter ||, suggest that the scheme for the pdfs 
can be chosen at will, with the scheme dependence of the pdfs being compensated for 
in the computation of the hard scattering. This reasoning appears to be incorrect. 
It starts from the assertion that the cross section is given by ('bare') pdfs convo- 
luted with on-shell partonic matrix elements computed without any subtractions. 
Although this statement is often repeated in the literature, we know of no proof. 
Indeed it is a clearly unphysical statement, since in the real world of QCD partons 
are never on-shell. Moreover it is not necessary |7| for a correct proof of factoriza- 
tion. The incorrect starting assumption is particularly inappropriate for work with 
an event generator, where one explicitly treats the showering of partons that have 
much lower virtuality than that largest scale in the process. Furthermore, as we 
stated above, the standard factorization theorem and its derivation are not sufficient 
by themselves to derive an algorithm for parton showering, at NLO accuracy. 
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Moreover, the method of Potter results in the real-emission part of the NLO hard 
scattering being obtained from unsubtracted partonic scattering graphs integrated 
down to a small cutoff s m i n . Real emission at NLO below the cutoff is assigned to 
the same parton configuration as the LO term; for this to be a useful approximation, 
s m i n must be substantially smaller than the primary scale Q 2 of the hard scattering. 
This immediately implies that there is a double logarithm of the ratio s m \ n /Q 2 in the 
integrated hard scattering coefficient. Since there will be corresponding logarithms 
in higher orders, this implies that the NLO hard scattering in this method is an 
inappropriate way of truncating a perturbation expansion. 

Similar remarks apply to other proposals along similar lines, for example that of 
Dobbs M. 



2.2 Pdfs in MC event generators 

In 0, a subtraction method was introduced to consistently take into account the 
LO and the NLO contributions for DIS in PYTHIA. Two different algorithms were 
discussed and formulas for the appropriate pdfs in terms of MS pdfs were derived 
at the level of gluon-induced NLO terms. The gluon-induced term is particularly 
important because of the large size of the gluon distribution functions at small x. 
The algorithms are the standard Bengtsson-Sjostrand algorithm H and a modified 
algorithm of Collins || which has improved kinematic properties. We label these 
algorithms "BS" and "JCC". 

In this section we review the derivation, which is made by comparing calcula- 
tions of the contribution of quark a to F 2 in the MS scheme with the corresponding 
calculation in each of the event-generator algorithms: 
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Here z = x/£ while P(z) = j[z 2 + (1 — z) 2 ] is the splitting kernel for gluon - 
quark- ant iquark pair. In the formula for the BS algorithm, X\ — x — |£(1 — cosi 
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and — t = Q 2 (l — cos9)^/2x, while C{—t) = 9(Q 2 + i) is the cutoff function for the 
showering. 

We define F^^, F 2 L ^ S and F^fcc ^° ^ e the nrs ^ terms on the right of each of 
Eqs. ( |2.2| ), (|2.3|) and (|2.4j) respectively. Similarly the second terms are called F^-^S, 
F^bs and respectively. 

Formulas follow immediately [0, |8| for the quark distribution function in the BS 
scheme and the JCC scheme in terms of those in the MS scheme: 

+ ^ [ dt-^it, » 2 ) [P{z) ln(l -z) + z(l - z)] 

+ NLO quark terms + 0(a 2 s ) (2.5) 
= xf^(x, /i 2 ) + if^§ - F 2 N j L c ° c + NLO quark terms + 0(a 2 s ). 
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+ NLO quark terms + 0(a 2 ), (2.6) 
= xf( m \x, ii 2 ) + F 2 N ^§ - F 2 N £ S ° + NLO quark terms + 0(a 2 ). 

Note that Eq. fl2.6|) is a nonlinear equation in terms of f® s ] this arises from the 
particular treatment of parton kinematics in the BS algorithm. When we calculate 
the numerical value of f® s , we will use in the integrand. We will justify this 
simplification in Sec. 2~A . 

We have performed numerical calculations of the quark densities. Our code is 
based on earlier work by Sabine Schilling ||. We have made the code available at 
I plH . Some results are shown in Fig. which gives the u quark distribution function 
in the MS and BS schemes at Q 2 = 625 GeV 2 , with the MS density being that of 
CTEQ5 HTT|| . This figure clearly shows a large difference between the BS pdf and 
the MS pdf at small x. The curves for the <i-quark distribution function are quite 
similar, so we do not show them. In Sees. [2.3| and |2.4| , we will analyze the scheme 
dependence of the pdfs in more detail. 

If we use MS pdfs rather than the ones appropriate to the BS algorithm of the 
event generator, the exclusive cross section will be in error by ~ 40% at small x, 
although it is possible to get the correct inclusive cross section, with the use of the 
well-known NLO correction in this scheme. As we will explain, this correction is 
unusually small, so that good results can be obtained for the inclusive cross section, 
i.e., for Fo, even without the use of the NLO correction, if the MS scheme is used. 
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We conclude that, while MS pdfs 
are well-defined quantities and are ap- 
propriately used in calculations of in- 
clusive cross sections, they are not suit- 
able for use in MC event generators 
where the fully exclusive cross sections 
are our main concern. 

For the same reason, the pdfs used 
in ||, U which are essentially MS pdfs 
with "scheme-dependence" corrections, 
are not appropriate pdfs to use in the 
event generators. 

2.3 The NLO contributions to F 2 

In this section we investigate the rela- 
tive size of the NLO and LO contribu- 
tions to F2, with the aid of some use- 
ful approximations, and we show that 
the substantial differences between the 
schemes are to be expected, because of 
the large size of the gluon distribution function. A surprising result is that the NLO 
corrections to F 2 in the MS scheme are unusually small, as the result of special 
cancellations. 

From Eqs. ( |2.2j ), ( |2.3| ) and ( |2.4| ), the relative sizes of the NLO terms can be 
estimated as follows: 

~ ^hVx)IUx). (2.7) 

This estimate is appropriate to the small x region. We have first observed that each 
integral contains a factor a s /2ir, a factor of the gluon distribution function and a 
factor of order unity. At small x, the important values of £ range from x to a modest 
factor times x, so that we set the argument of the gluon distribution function to 2x, 
as an estimate of the typical value of £. 

The ratio of NLO to LO would generally be at most of order a s /27c, which is 
appropriate for a generic NLO correction, were it not that the gluon distribution 
function is large at small x. Clearly, there should be an enhancement of the NLO 
contribution, and the above formula gives the expected size. 

Fig. [3] displays the numerical value of the ratio of the NLO and the LO terms 
in F2 for different factorization schemes at Q 2 = 33.8 GeV 2 , in the case of the u 
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Figure 2: The left-hand graph gives the 
u-quark distribution at small x at Q 2 = 
625 GeV 2 . The lines are simple power law 
fits to the curves: xu(x) = 0.135x~ 0,32 and 
xu(x) = 0.14x -0 ' 27 for the upper (MS) and 
lower (BS scheme) curves. The inset gives the 
pdf at all x. 
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NLO/LO BS scheme 

NLO/LO MS scheme 

Estimated ratio 



quark. The figure also show the results for the simple estimate (|2.7| ). Results for 
other flavors would be similar. 

For the BS scheme, the large gluon distribution function at small x does indeed 
give a substantial effect: the NLO contribution is close to 100% of the LO contri- 
bution at x = 10 -4 . The effect is much smaller at large x. The plot of the simple 
estimate ( |2.7| ) shows that this behavior is completely expected. The accuracy of the 
approximation is an accident, but the overall size is not. 

For the commonly used MS scheme, 
the NLO corrections are rather small for 
all x. In view of the expected size of 
generic NLO corrections, we should re- 
gard the small size of the correction in 
MS as an accident that is useful for phe- 
nomenology rather than fundamen- 
tal expectation. The cancellation is as- 
sociated with the particular sizes of the 
positive and negative terms in the second 
line of Eq. Q. 

To understand this cancellation bet- 
ter, it is convenient to perform a slightly 
different approximation where we replace 
the pdfs by power laws. At small x, the 
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Figure 3: Ratio of the NLO and LO terms 
in i*2 from u quark contribution for differ- 
ent factorization schemes. We also show the 
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In each case, the lower limit of the z integral can be set to zero when x is small, and 
we therefore have a simple power of x times a fixed integral over z. 
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The integrals of z in the above equations are basically all of 0(1). The exponent 
q is about 1.2 ~ 1.4 for the gluon distribution function. The exponent r for the 
quark distribution is close to 1 and less than q. Therefore, the NLO corrections will 
be enhanced at small x. However there are negative terms in the MS integral which 
results in a cancellation. There is a weaker cancellation in the integral for the JCC 
scheme, but there is no cancellation in the integral for the BS scheme. 

2.4 Accuracy of nonlinear term in BS pdfs 

The BS quark distribution function is related to the MS distribution function by the 
nonlinear integral shown in Eq. (|2.6|) . As is normal, we replace the BS pdfs on the 
right-hand side by the MS pdfs, so that we get a formula involving MS pdfs only. 
Generally this is the normal procedure since the change involves an effect of relative 
order a s in an NLO term. In transformations that are linear in the pdfs, this is quite 
sensible: All the errors are handled by the uncalculated terms of yet higher order. 
However, this is more delicate for the nonlinear formula, particularly given that the 
quark distribution functions have large corrections. A linear formula, as for the JCC 
scheme, only needs the gluon distribution function, but our nonlinear formula also 
involves the quark distribution functions. 

In this section we will show that this issue does not in fact affect the accuracy of 
our calculations, since the right-hand side of Eq. ( |2.6| ) only involves a ratio of quark 
distribution functions; the ratio of BS quark distribution functions, fa S ( x )/fa S ( x i)i 
is equal to the ratio of the MS distribution functions to a good approximation. 

Our demonstration is semi-analytic, so that we can see that the result is robust 
against changes in the pdfs. 

In Fig. |2|, we fit the small x pdfs with y = Ax B . The exponent B of x depends 
on Q 2 , but the difference of exponent between the BS scheme and the MS scheme is 
roughly the same for all Q 2 and approximately equal to 0.05. 

The error introduced by the replacement of the BS quark distribution functions 
by the MS distribution functions on the right-hand side of Eq. (|2.6|) is then 



Z7T JO Z \Zj J\-2z 1 — COS# 



f^(x) f™(x) 



, (2.11) 



where Z\ — z — (1 — cos#)/2. 

The error 5 is the largest in small x region because of the large difference between 
the MS pdfs and the BS pdfs. When cos 9^ l—2z,x\ — > 0, and then f a (x)/f a (xi) — > 
0, therefore 5 is very small. When cos 9 — > 1, x\ ~ x, we have 

l-|(l-ooBe)^ (2.12) 

Ja{Xl) 2 f a (x) 

and f^(x) ~ Ax/x 1 ' 32 , / a BS (x) ~ A 2 /x 127 . 
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We can see that 5 depends on the pdfs through their difference in exponents of 
x, rather than on the actual value of pdfs. Given the exponents of the BS pdf and 
the MS pdf, we have, 

8 « ^ / dz-U-)P{z)/z x 0.025 (2.13) 
2ti Jo z z 

which is less than 5% of F^s- Therefore, our simplification is valid up to NLO 
accuracy. 



3. Conclusion 

We explained that the pdfs in MC event generators are determined by the showering 
algorithm, and cannot be freely chosen, unlike the case for pdfs used in inclusive 
calculations. The rules for calculating the hard-scattering coefficients at higher orders 
are then unambiguously defined. We then presented some numerical calculations of 
the quark distribution functions to be used with the BS algorithm. At small x the 
corrections are large, so that the commonly used MS pdfs are inappropriate for use 
in event generators. We used some simple approximations to understand the size of 
the corrections and to show that the large correction is to be expected. 



The code for the MC-specific pdfs is available at [JTQ | . 



Note Added 



Early papers on HERWIG, for example the paper of Marchesini and Webber |12| , also 
mentioned the idea that the showering algorithm entails a particular definition of the 
pdfs, with cutoffs on parton kinematics that correspond to cutoffs in the showering. 
However, since the event generator was only implemented at leading order, the need 
for modified pdfs was not emphasized in Ref. ||12|| ; it was sufficient to use unmodified 
pdfs from standard fits. 

We thank the referee for bringing this work to our attention. 
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